AITopics | multiple set

Collaborative Refining for Learning from Inaccurate Labels

Neural Information Processing SystemsMar-21-2026, 23:28:48 GMT

This paper considers the problem of learning from multiple sets of inaccurate labels, which can be easily obtained from low-cost annotators, such as rule-based annotators. Previous works typically concentrate on aggregating information from all the annotators, overlooking the significance of data refinement. This paper presents a collaborative refining approach for learning from inaccurate labels. To refine the data, we introduce the annotator agreement as an instrument, which refers to whether multiple annotators agree or disagree on the labels for a given sample. For samples where some annotators disagree, a comparative strategy is proposed to filter noise.

annotator, artificial intelligence, machine learning, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.39)

Add feedback

Collaborative Refining for Learning from Inaccurate Labels

Neural Information Processing SystemsFeb-17-2026, 07:17:18 GMT

For samples where some annotators disagree, a comparative strategy is proposed to filter noise.

annotator, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Country:

Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > Experimental Study (1.00)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
(2 more...)

Add feedback

a8809ae67a7aad49a64d615468d72808-Paper-Conference.pdf

Neural Information Processing SystemsOct-10-2025, 12:35:56 GMT

annotator, dataset, experiment, (14 more...)

Neural Information Processing Systems

Country:

Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > Experimental Study (1.00)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
(3 more...)

Add feedback

Collaborative Refining for Learning from Inaccurate Labels

Neural Information Processing SystemsMay-27-2025, 11:58:38 GMT

This paper considers the problem of learning from multiple sets of inaccurate labels, which can be easily obtained from low-cost annotators, such as rule-based annotators. Previous works typically concentrate on aggregating information from all the annotators, overlooking the significance of data refinement. This paper presents a collaborative refining approach for learning from inaccurate labels. To refine the data, we introduce the annotator agreement as an instrument, which refers to whether multiple annotators agree or disagree on the labels for a given sample. For samples where some annotators disagree, a comparative strategy is proposed to filter noise.

annotator, collaborative refining, inaccurate label, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.42)

Add feedback

Clustering sequence sets for motif discovery

Neural Information Processing SystemsApr-6-2023, 13:52:03 GMT

Most of existing methods for DNA motif discovery consider only a single set of sequences to find an over-represented motif. In contrast, we consider multiple sets of sequences where we group sets associated with the same motif into a cluster, assuming that each set involves a single motif. Clustering sets of sequences yields clusters of coherent motifs, improving signal-to-noise ratio or enabling us to identify multiple motifs. We present a probabilistic model for DNA motif discovery where we identify multiple motifs through searching for patterns which are shared across multiple sets of sequences. Our model infers cluster-indicating latent variables and learns motifs simultaneously, where these two tasks interact with each other.

motif, motif discovery, sequence, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.45)

Add feedback

Ensemble knowledge distillation of self-supervised speech models

Huang, Kuan-Po, Feng, Tzu-hsun, Fu, Yu-Kuan, Hsu, Tsu-Yuan, Yen, Po-Chieh, Tseng, Wei-Cheng, Chang, Kai-Wei, Lee, Hung-yi

arXiv.org Artificial IntelligenceFeb-24-2023

Distilled self-supervised models have shown competitive performance and efficiency in recent years. However, there is a lack of experience in jointly distilling multiple self-supervised speech models. In our work, we performed Ensemble Knowledge Distillation (EKD) on various self-supervised speech models such as HuBERT, RobustHuBERT, and WavLM. We tried two different aggregation techniques, layerwise-average and layerwise-concatenation, to the representations of different teacher models and found that the former was more effective. On top of that, we proposed a multiple prediction head method for student models to predict different layer outputs of multiple teacher models simultaneously. The experimental results show that our method improves the performance of the distilled models on four downstream speech processing tasks, Phoneme Recognition, Speaker Identification, Emotion Recognition, and Automatic Speech Recognition in the hidden-set track of the SUPERB benchmark.

artificial intelligence, machine learning, teacher model, (16 more...)

arXiv.org Artificial Intelligence

2302.12757

Country:

Pacific Ocean > North Pacific Ocean > San Francisco Bay (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)
Asia > Taiwan (0.04)

Genre: Research Report > New Finding (0.34)

Industry: Education (0.37)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.72)

Add feedback

FedEx is shutting down its robot delivery program

#artificialintelligenceOct-21-2022, 15:15:23 GMT

Roxo was announced in 2019 as a collaboration with DEKA, makers of the iBot wheelchair, which used multiple sets of wheels to "walk" up and down stairs, and raise its user from a sitting level to eye-height. Roxo also used multiple sets of wheels to climb steps and curbs. The robot had a top speed of 10mph, a cargo capacity of 100lbs (45kg), and was able to autonomously navigate around cars and pedestrians using cameras and LIDAR sensors. Human operators were used to oversee its movements and steer it manually if necessary.

fedex, multiple set, robot delivery program

#artificialintelligence

Industry: Transportation > Freight & Logistics Services (0.40)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.40)

Add feedback

Clustering sequence sets for motif discovery

Kim, Jong K., Choi, Seungjin

Neural Information Processing SystemsFeb-15-2020, 02:26:12 GMT

Most of existing methods for DNA motif discovery consider only a single set of sequences to find an over-represented motif. In contrast, we consider multiple sets of sequences where we group sets associated with the same motif into a cluster, assuming that each set involves a single motif. Clustering sets of sequences yields clusters of coherent motifs, improving signal-to-noise ratio or enabling us to identify multiple motifs. We present a probabilistic model for DNA motif discovery where we identify multiple motifs through searching for patterns which are shared across multiple sets of sequences. Our model infers cluster-indicating latent variables and learns motifs simultaneously, where these two tasks interact with each other.

motif, motif discovery, sequence, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.50)

Add feedback

Solving big data's 'fusion' problem

#artificialintelligenceJul-23-2016, 13:34:33 GMT

As the field of "big data" has emerged as a tool for solving all sorts of scientific and societal questions, one of the main challenges that remains is whether, and how, multiple sets of data from various sources could be combined to determine cause-and-effect relationships in new and untested situations. Now, computer scientists from UCLA and Purdue University have devised a theoretical solution to that problem. Their research, which was published this month in the Proceedings of the National Academy of Sciences, could help improve scientists' ability to understand health care, economics, the environment and other areas of study, and to glean much more pertinent insight from data. The study's authors are Judea Pearl, a distinguished professor of computer science at the UCLA Henry Samueli School of Engineering and Applied Science, and Elias Bareinboim, an assistant professor of computer science at Purdue University who earned his doctorate at UCLA. Big data involves using mountains and mountains of information to uncover trends and patterns.

artificial intelligence, big data, data mining, (13 more...)

#artificialintelligence

Country:

North America > United States > Texas (0.06)
North America > United States > California > Los Angeles County > Los Angeles (0.06)
Africa > Kenya (0.06)

Genre: Research Report > Experimental Study (0.32)

Industry: Health & Medicine (0.52)

Technology:

Information Technology > Artificial Intelligence (0.90)
Information Technology > Data Science > Data Mining > Big Data (0.85)

Add feedback

Clustering sequence sets for motif discovery

Kim, Jong K., Choi, Seungjin

Neural Information Processing SystemsDec-31-2009

Most of existing methods for DNA motif discovery consider only a single set of sequences to find an over-represented motif. In contrast, we consider multiple sets of sequences where we group sets associated with the same motif into a cluster, assuming that each set involves a single motif. Clustering sets of sequences yields clusters of coherent motifs, improving signal-to-noise ratio or enabling us to identify multiple motifs. We present a probabilistic model for DNA motif discovery where we identify multiple motifs through searching for patterns which are shared across multiple sets of sequences. Our model infers cluster-indicating latent variables and learns motifs simultaneously, where these two tasks interact with each other. We show that our model can handle various motif discovery problems, depending on how to construct multiple sets of sequences. Experiments on three different problems for discovering DNA motifs emphasize the useful behavior and confirm the substantial gains over existing methods where only single set of sequences is considered.

artificial intelligence, bioinformatics, machine learning, (18 more...)

Neural Information Processing Systems

Country: Asia (0.14)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology: